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Detecting the risk of cardiovascular diseases by detecting mutations in genes, 
including genes encoding ot2b-adrenoceptor and apolipoprotein B 

The present invention provides a method of identifying subject's susceptibility to 
5 cardiovascular diseases or risk of developing myocardial infarction (MI) or 

cerebrovascular stroke by detecting gene polymorphisms and other gene mutations from a 
biological sample of the subject and optionally obtaining information concerning the 
family and medical history, blood, serum, plasma, urinary analytes and clinical findings of 
the subject. The invention also provides a multivariate model, a combination or algorithm 
10 of variables which best describes the probability of cardiovascular diseases, especially MI 
and stroke. The invention also relates to a test kit and software for accomplishing the 
method. 

1 5 FIELD OF THE INVENTION 

The present invention is generally directed to a method for assessing the risk of 
myocardial infarction (MI) and cerebrovascular stroke in an individual, such as a human. 
Specifically, the invention is directed to a method that utilises both genetic and phenotypic 

20 information as well as information obtained by questionnaires to construct a score that 
provides the probability of developing an MI or stroke. Furthermore, the invention 
provides a kit for carrying out the method. The kit can be used to set an etiology-based 
diagnosis of cardiovascular diseases for targeting of treatment and preventive 
interventions, such as dietary advice as well as stratification of the subject in clinical trials 

25 testing drugs and other interventions. 

BACKGROUND OF THE INVENTION 

The coronary heart disease (CHD) and cerebrovascular disease are multifactorial diseases 
30 and the leading causes of morbidity, death and disability globally. Even though the age- 
standardized incidence of and mortality from CHD and stroke are still declining in the 
Western world, the number of cardiovascular events and subsequent hospitalizations and 
expenditure are increasing, due to the elevation of life expectancy of the population. It has 
been estimated based on twin and migration studies that the heritability of CHD and stroke 
35 is of the order of 50-60% and there are no major gene effects. Thus, multiple genes and 
non-genetic risk factors contribute to the development and progression of CHD. Different 
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clinical manifestations of CHD (i.e. angina pectoris, myocardial infarction, sudden death) 
and stroke have overlapping but also somewhat distinct pathophysiology and risk factors. 

CHD and stroke may be caused in different individuals by different reasons and through 
5 different pathophysiologic pathways. Often, however, the same risk factors and pathways 
are operating, but their importance for each individual varies. Regarding pathophysiology, 
CHD and stroke may be caused by obstruction of the coronary (cerebrovascular) arteries, 
vasoconstriction or vasospasm in these, thrombotic phenomena or arrhythmias. Coronary 
and cerebrovascular arterial obstruction is most often caused by atheroma formation. This 
10 is a complex disease, but lipids and their metabolism such as oxidation plays a key role. 
Other major factors leading to atheroma formation are tobacco smoking, hypertension, 
diabetes, obesity and hyperhomocysteinemia. Additional risk factors include elevated 
coagulation factors, platelet activation and decreased nitric oxid availability. Men, older 
persons and those with a family history of CHD are at elevated risk. 

15 

Persons who have mutations in genes regulating lipids, their metabolism, blood pressure, 
platelet functions, coagulation, fibrinolysis, homocysteine metabolism and the function of 
the cardiac muscle can be expected to be at an elevated risk of CHD. Assessing a number 
of these mutations can be used to predict MI and cerebrovascular stroke.. 

20 

A number of meta-analyses have studied multivariate risk functions from diverse 
populations in the prediction of CHD. None of these have concerned the effects of specific 
genotyped gene mutations. A recent meta-analysis concerned ordering risk, magnitude of 
relative risks, and estimation of absolute risk in prospective cohort studies (Diverse 

25 Populations Collaborative Group 2002). The outcome measure was death from CHD. The 
analysis included 105 420 men and 56 535 women 35-74 years of age and free of CHD at 
baseline from 16 observational studies with a total of 27 analytical groups. The area under 
the receiver operating characteristic curve (AUC) was used to judge the ability of the 
multivariate risk function to order risk correctly. The AUCs differed significantly between 

30 the studies (p < 0.01) but were very similar for different risk functions applied to the same 
population, indicating similar ability to rank risk for different models. The magnitudes of 
the relative risks associated with major risk factors (age, systolic blood pressure, serum 
total cholesterol, smoking, and diabetes) varied significantly across studies (p < 0.05 for 
homogeneity). The prediction of absolute risk was not very accurate in most of the cases 
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when a model derived from one study was applied to a different study. The authors 
concluded that when considered qualitatively, the major risk factors are associated with 
CHD mortality in a diverse set of populations. 

5 The new Sheffield table and modified joint British societies coronary risk prediction (JBS) 
chart are widely used (Rabindranath et al. 2002). The JBS chart approximates age and 
systolic blood pressure, and the new Sheffield table dichotomises blood pressure, and these 
simplifications may lead to diagnostic inaccuracy. Methods: The diagnostic performance 
of the charts against an individualised laboratory based CHD risk calculation in 1 102 
10 subjects in primary care were evaluated and compared. The new Sheffield table and 
modified JBS chart performed equally well. 

Most previously used models used to predict individual risk of death from coronary heart 
disease (CHD) were developed from data of three decades ago from the Framingham Heart 

15 Study. CHD mortality rates have declined markedly since that period as a result of 

improvement in both risk factor status and medical interventions. Generalization of the 
results from this one study to the population at large remains a matter of concern. Liao and 
coworkers (1999) compared predictive functions derived from the major risk factors for 
CHD from Framingham and two more recent American cohorts, the First and Second 

20 National Health and Nutrition Examination Survey (NHANES I and NHANES II). The 

participants included 1846 men and 2323 women 35 to 69 years of age and free of CHD at 
the fourth examination (1954 to 1958) from the Framingham Study; 2753 men and 3858 
women from the NHANES I (1971 to 1975); and 2655 men and 3050 women from 
NHANES II (1976 to 1980). The three cohorts were monitored for 24, 20, and 15 years, 

25 respectively. Significant heterogeneity existed among studies in the magnitude of the Cox 
coefficients for the individual factors (ie, age, systolic blood pressure, serum total 
cholesterol, and smoking status), especially among men. When risk factors were 
considered collectively, however, functions derived from and applied to different cohorts 
had a similar ability to rank individual risk. The areas under the receiver operating 

30 characteristic curves were 0. 71 to 0.76 in men and 0.76 to 0.81 in women when different 
risk functions were applied to their own population or to a second population. The 
cumulative CHD survival observed in women in the two cohorts was close to what was 
predicted from the Framingham equation. The authors concluded that the Framingham risk 
model for the prediction of CHD mortality rates provides a reasonable rank ordering of risk 
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for individuals in the US white population for the period 1975 to 1990. However, 
prediction of absolute risk is less accurate. 

SUMMARY OF THE INVENTION 

5 

The object of the present invention is a method of identifying the risk of cardiovascular 
diseases, especially MI and stroke, by detecting gene polymorphisms and other gene 
mutations from a biological sample of the subject. The information obtained from this 
method can be combined with other information concerning an individual, e.g. results from 

10 blood measurements, clinical examination and questionnaires. The genetic information 
includes data on mutations in genes associated with MI and/or stroke. The blood 
measurements include the determination of blood or plasma or serum analytes that predict 
CHD or stroke such as blood lipid, homocysteine, glucose, and insulin concentrations and 
urinary excretion of nicotine metabolites. The information to be collected by questionnaire 

15 includes information concerning gender, age, family and medical history and health-related 
habits such as smoking. Clinical information collected by examination includes e.g. 
information concerning height, weight, hip anf waist circumference, systolic and diastolic 
blood pressure, heart rate, other electro-/audiographic parameters and maximal oxugen 
uptake. 

20 

The invention particularly provides a method for diagnosing a susceptibility to 
cardiovascular disease especially myocardial infarction (MI) and stroke in a subject 
by detecting genetic variation or polymorphism, i.e. a mutation, in at least three of 
the genes selected from the group consisting of: 

25 (a) cc2B-adrenoceptor 

(b) apolipoprotein B 

(c) dimethylarginine dimethylaminohydrolase 1 

(d) fibrinogen-beta 

(e) neuropeptide Y 

30 (f) natriuretic peptide precursor A 

(g) cystathione beta synthase 

(h) glycoprotein Ilb/IIIa 

(i) lipoprotein lipase 
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comprising the steps of: 

i) providing a biological sample of the subject to be tested, 

ii) detecting the presence of mutations in the genes, the presence of a mutation in one 

5 or several of the genes indicating an increased risk of coronary heart disease (CHD) and/or 
myocardial infarction (MI) in said subject. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE 
INVENTION 

10 

In a preferred embodiment the invention comprises the combination of information from a 
large number of variables (measurements) to predict the probability of MI and stroke. The 
predictor information includes an assessment of genotypes and haplotypes in genomic 
DNA and optionally data obtainable by interviews, questionnaires, clinical examination 
1 5 and/or blood analyte measurements. This predictor information can be collected in any age. 
This method is also applicable to middle-aged persons. 

Information concerning genomic DNA genotypes concerns polymorphisms such as single 
nucleotide polymorphisms (SNPs) and mutations in e.g. the following genes (OMIM 

20 abbreviations): APOA1, APOA2, APOA4, APOB, APOC1, APOC2, APOC3, APOC4, 
APOD, APOE, ARG, LDLR, OLR1, MSR1, MSR2 , LPA, LPL, LIPC, LIPG, CETP, ETL, 
GPIIIa, ICAM1, ICAM2, ICAM3, SELL, SELE, MMP1, MMP3, ITGB n, ADD1, ADD2, 
ADD3, NPY, NPY1R, NPY2R, NPY3R, NPY4R, NPY5R, HFE1, HFE2, HFE3, TFRC, 
TFR2, PON1, PON2, SOD1, SOD2, SOD3, CAT, GSTM1, GSTM2, GSTM3, GSTP1, 

25 GPX1, GPX3, TNFA, TNFB, TRX, NOS3, NOS3, DDAH1, DDAH2, ADRB1, ADRB2, 
ADRB3, F2, F5, F7, F8, F13, VWF, PAI1, PAI2, FGA, FGB, FGG, ACE, AGT, AGTR1, 
ATG, SCAP, SCNN1 A, SCNN1B, NPPA, CBS, MTHFR, or any other candidate genes 
that will be observed to relate to the susceptibility to MI or stroke. 

30 The data that can be obtained by questionnaire, interview or clinical examination includes 
information concerning: 

1) age, 

2) gender, 
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3) medical history, i.e. prevalent diseases, 

4) family history, i.e. diseases of parents and siblings, 

5) tobacco smoking, 

6) alcohol use, 

5 7) physical activity and exercise, 

8) high weight or obesity in childhood and adolescence, 

9) personality traits such as depression, anxiety, hostility, 

10) psychological and mood states such as anger, irritability, 

11) low self-esteem or weak self-image, 

10 12) lack of social skills, social isolation, lack of social networks, 

13) self-image promoting alcohol use (e.g. easy-taking), 

14) adulthood socioeconomic circumstances (e.g. being single, divorced or widowed as 
the marital status, posessing no phone, low socioeconomic status, unemployment 
and urban place of residence, 

15 15) stressful life events, 

1 6) coping styles, coping capacity, anger control, 

1 7) history of diabetes, 

1 8) high perceived cardiovascular risk, 

19) high amount of hospitalizations, poor health status, 
20 20) blood pressure, heart rate, maximal oxygen uptake, 

21) other relevant information that can be collected by self-administered questionnaire, 
by an interview or by clinical examination of the subject. 

Information obtainable by measurements from blood, blood cell, plasma, serum or urine 
25 samples includes: 

1) serum or plasma cholesterol, HDL and LDL cholesterol, 

2) serum or plasma triglycerides, 

3) serum or plasma apolipoproteins, 

4) serum or plasma insulin concentration, 
30 5) blood or serum glucose concentration, 

6) blood hemoglobin concentration, 

7) serum ferritin or transferring receptor concentrations, 

8) serum fibrinogen and other coagulation factor concentration, 

9) measurement of platelet activation, aggregation and/or adhesion, 
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10) serum or plasma concentrations of inflammatory markers such as CRP, 

11) other relevant information that can be obtained by chemical or biochamical 
measurements. 

5 Numerous genotyping methods have been described in the art for analysing nucleic acids 
for the presence of specific sequence variations e.g. SNP's, insertions and deletions (for 
review see Syvanen 2001 and Nedelcheva Kristensen et al. 2001). In these methods a 
sample containing nucleic acid (e.g. blood, tissue biopsy or buccal cells) is obtained from 
the patient and the sequence variations of interest are identified and visualised from the 
10 nucleic acids. 

Allelic variants in genes can be discriminated by enzymatic methods (with the aid of 
restriction endonucleases, DNA polymerases, ligases etc.), by electrophoretic methods 
(e.g. single strand conformation polymorphism (SSCP), heteroduplex analysis, fragment 

1 5 analysis and DNA sequencing), by solid-phase assays (dot blots, microarrays, 

microparticles, microtiter plates etc.) and by physical methods (e.g. hybridisation analysis, 
mass spectrometry and denaturing high performance liquid chromatography (DHPLC)). In 
most of the genotyping assays different polymerase chain reaction (PCR) applications are 
used both to increase the signal to noise ratio as well as spare sample nucleic acid before 

20 allele discrimination. Detectable labels (fluorochromes, radioactive labels, biotin, modified 
nucleotides, haptens etc) can be used to enhance visualization of allelic variants. 

This invention is based on the principle that a small number of genotypings are performed, 
and the mutations to be typed are selected on the basis of their ability to predict MI and/or 
25 stroke. For this reason any method to genotype mutations in a genomic DNA sample can 
be used. If non-parallel methods such as real-time PCR are used, the typings are done in a 
row. The PCR reactions may be multiplexed or carried out separately in a row or in 
parallel aliquots. 

30 The score that predicts the probability of MI or stroke may be calculated using a 
multivariate failure time model or a logistic regression equation as follows: 
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Probability of a cardiovascular disease = [1 + e (_( " a + z ( b, * Xl »j wherein e is Napier's 
constant, Xi are variables related to the cardiovascular disease, b\ are coefficients of these 
variables in the logistic function, and a is the constant term in the logistic function. The 
model may additionally include any interaction (product) or terms of any variables Xj, e.g. 
5 bjXi. An algorithm is developed for combining the information to yield a simple prediction 
of MI as percentage of risk in 10 years. An alternative statistical model is a failure-time 
model such as the Cox's proportional hazards' model. 

EXPERIMENTAL SECTION 
10 Determining individual genotypes with SNaPShot 

The method according to the invention for the determination of the allelic pattern of the 
codons/mutations in question can be carried out with polymerase chain reaction (PCR) in 
combination with, for example, an allele specific primer extension method (SNaPshot, 
1 5 Applied Biosystems) or DNA fragment analysis followed by capillary electrophoresis with 
ABI Prism 3 100 Genetic Analyzer (Applied Biosystems). 

In a SNaPshot reaction the genomic DNA region containing the mutation is question is 
amplified with PCR. The amplified PCR reaction is purified and the product is used as a 
20 template in SNaPshot reaction. 

For the SNaPshot reaction an extension primer that ends one nucleotide 5' of a given 
single nucleotide polymorphism (SNP) locus is designed. In the SNaPshot reaction the 
extension primer binds to its complementary template in the presence of fluorescent 
25 labelled dideoxy-NTPs ([FJddNTPs) and DNA polymerase. The polymerase extends the 
primer by only one nucleotide, adding a single [F]ddNTP to its 3' end. In the analysed data 
nucleotide A is seen in green colour, C is seen in black colour, G is seen in blue colour and 
T in red colour. If for example the genotype is A/A then only green colour is detected. For 
a heterozygous A/C green and black colour are detected. 

30 

When multiple SNPs are determined in the same reaction, the extension primers need to 
differ significantly in length (4-6 nucleotides) to avoid overlap between the final SNaPshot 
products. This can be accomplished by adding a variable number of nucleotides dT, dA, 
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dC or cGATC to the 5' end of the different extension primers. The different SNPs can then 
be detected in the capillary electrophoresis according to the different size of the SNaPshot 
product. To perform SnaPshot genotyping under standard conditions, refer to the user 
manual (ABI Prism SnaPshot Multiplex kit, Protocol, Applied Biosystems). 

5 

In the DNA fragment analysis, a fluorescent lable is attached to the 5' end of the PCR 
primer. In the DNA fragment analysis, the alleles of the locus to be genotyped are different 
in length (i.e. there is a deletion or an insertion of known number of nucleotides in the 
studied locus). The different alleles can then be detected after the capillary electrophoresis 
10 due to the different migration rates of the different lengths of the per product (i.e. alleles). 

Polymerase chain reaction (PCR^ 

The genomic DNA regions containing the mutations in question can be amplified with 
15 PCR either in separate reactions or all in one single reaction mix (i.e. multiplex PCR) with 
PTC-220 DNA Engine Dyad PCR machine (MJ Research). The PCR amplification was 
conducted in a 20 ptl volume: the reaction mixture contained 60 ng human genomic DNA 
(extracted from peripheral blood), IX PCR Buffer (QIAGEN), 100 \il of each of the 
nucleotides (dATP, dCTP, dGTP, dTTP), 0.5 nM of each primers and 1 unit of the DNA 
20 polymerase (QIAGEN, Hot Start Taq DNA polymerase). The PCR conditions need to be 
determined experimentally, and the following standard protocol can be used as a start: first 
the reaction was hold 10 minutes at 94°C, then the following three steps were repeated for 
35 cycles: 45 seconds at 94°C, 45 seconds at 55°C, 1 minute 30 seconds at 72°C, after 
which the reaction was kept at 72°C for an additional 5 minutes and finally hold at 4°C. 

25 

APOB Thr98Ile (also known as APOB ThrTlIle) 

The nucleotide sequence of the primer pair for the amplification of human APOB gene 
(apolipoprotein B gene) Thr98Ile mutation (SEQ ID NO:l) (SEQ ID NO:3) (also known as 
Thr71Ile mutation) was as follow: 5'- GAC AAC CTC AAT GCT CTG CT -3' (SEQ ID 
30 NO:5) and 5'- TGA CTT ACC TGG ACA TGG CT -3' (SEQ ID NO:6). 

NPPA Val32Met 

The nucleotide sequence of the primer pair for the amplification of human NPPA 
(natriuretic peptide precursor A gene) (gene is also known as ANF or ANP or PND or 
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Pronatriodilatin (atrial natriuretic peptide)) Val32Met mutation (SEQ ID NO: 8) (SEQ ID 
NO:10) was as follow: 5'- GCC AAG AGA GGG GAA CCA GAG -3 s (SEQ ID NO:X12) 
and 5'- AGT GAG CAC AGC ATC AGA AAG C-3' (SEQ ID NO: 13). 

5 DDAH1 IYS2-33QT 

The nucleotide sequence of the primer pair for the amplification of human DDAH1 
(dimethylarginine dimethylaminohydrolase 1) IVS2-33C>T mutation (in the following 
sequences SEQ ID NO: 15 and SEQ ID NO: 16 the IVS2-33C>T polymorphism is located 
at the position 1041) was as follows: 5'- ATC CTG CTT TCT GCC CTT T -3' (SEQ ID 
10 NO: 17) and 5'- AAG CCA GTG AAG CGT AAA CAC-3' (SEQ ID NO: 18). 

FGB -455G>A 

The nucleotide sequence of the primer pair for the amplification of human FGB gene 
(fibrinogen-beta gene) promoter mutation -455G>A mutation (In the following sequences 
15 SEQ ID NO:20 and 21 the FGB -455G>A polymorphism is located at the position 1437) 
(SEQ ID NO:20) (SEQ ID NO:21) was as follow: 5'- AAC ACA CAA GTG AAC AGA 
CAA G-3' (SEQ ID NO:22) and 5'- GCA CTC CTC AAA GAG AGA TG -3' (SEQ ID 
NO:23). 

20 NPY -52QG 

The nucleotide sequence of the primer pair for the amplification of human NPY gene 
(neuropeptide Y gene) -52 C>G mutation (in the following sequences SEQ ID NO:25 and 
26 the NPY -520G polymorphism is located at the position 1000) (SEQ ID NO:25) 
(SEQ ID NO:26) was as follow: 5'- GTT CTC TCT GCG GGA CTG GG-3' and (SEQ ID 
25 NO:27) 5'- CTG CCC TGG GAT AGA GCG AA-3' (SEQ ID NO:28). 

CBS Ile278Thr 

The nucleotide sequence of the primer pair for the amplification of human CBS gene 
(cystathionine-beta-synthase gene) Ile278Thr mutation (SEQ ID NO:36, SEQ ID NO:38) 
30 was as follow: 5'- GAG CCT GGG TTC TTG GGT TTC -3' (SEQ ID NO:40) and 5'- 
GGT TGT CTG CTC CGT CTG GTT -3' (SEQ ID NO:41). 



LPL Asn3 1 8Ser (also known as LPL Asn29 1 Ser mutation^ 
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The nucleotide sequence of the primer pair for the amplification of human LPL gene 
(lipoprotein lipase gene) Asn318Ser mutation (SEQ ID NO:43) (SEQ ID NO:45) (also 
known as LPL Asn291Ser mutation) was as follow: 5'- CGC TCC ATT CAT CTC TTC 
ATC G -3' (SEQ ID NO:47) and 5'- CCC CCT ATC AAC AGA AAC ACC A -3' (SEQ 
5 ID NO:48). 

ITGB3 Leu59Pro (also known as Leu33Pro mutation) 

The nucleotide sequence of the primer pair for the amplification of human ITGB3 
(integrin, beta 3, (platelet glycoprotein Ilia, antigen CD61) Leu59Pro mutation (SEQ ID 
10 NO:50) (SEQ ID NO:52) (also known as Leu33Pro mutation) was as follow: 5'- GCA 
GGA GGT AGA GAG TCG CCA -3' (SEQ ID NO:54) and 5'- GGG CAC AGT TAT 
CCT TCA GCA-3' (SEQ ID NO:55). 

NPPA OPA152Arg 

15 The nucleotide sequence of the primer pair for the amplification of human NPPA 
(natriuretic peptide precursor A gene) (gene is also known as ANF or ANP or PND or 
Pronatriodilatin (atrial natriuretic peptide)) OPA152Arg mutation (SEQ ID NO:57) (SEQ 
ID NO:59) was as follow: 5'- TTA GCA GTT CAT ATT CCT CCC C -3' (SEQ ID 
NO:61) and 5'- AGC CTC TTG CAG TCT GTC CC -3' (SEQ ID NO:62). 

20 

Purification of the PCR products for SNaPshot reaction 

The PCR products were purified with SAP (Shrimp Alkalinen Phosphatase, USB 
Corporation) and Exol (Exonuclease I, USB Corporation) treatment. This was done to 
25 avoid the participation of the unincorporated dNTPs and primers from the PCR reaction to 
the subsequent primer-extension reaction. More specifically 5 units of SAP and 2 units of 
Exol were added to 15 /xl of the PCR product. Reaction was mixed and incubated at 37°C 
for 1 hour. After that the reaction was incubated at 75°C for 15 minutes to inactivate the 
enzymes and afterwards kept at 4°C. 

30 

Primer extension reaction (SNaPshot reaction*) 

In the subsequent primer extension reaction (SNaPshot reaction) 5 /xl of SNaPshot 
Multiplex Ready Reaction Mix (Applied Biosystems), 3 jul of purified PCR products, 1 fil 
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of pooled extension primers (depending of the signal in the SNaPshot reaction, the primer 
concentrations in the mix can range between 0.05 jaM and 1 ^M) and 1 /d water are mixed 
in a tube. The reaction is incubated at 94 °C for 2 minutes and then subject to 25 cycles of 
95°C for 5 s, 50°C for 5 s and 60°C for 5 s in a PTC-220 DNA Engine Dyad PCR machine 
5 (MJ Research). 

The nucleotide sequence of the extension primer for the genotyping of human APOB 
Thr71Ile mutation in a SNaPShot reaction was as follow: 5'- TTT TTT TTT TTT TGA 
AGA CCA GCC AGT GCA -3' (SEQ ID NO:7). 

10 

The nucleotide sequence of the extension primer for the genotyping of human NPPA 
Val32Met mutation in a SNaPShot reaction was as follow: 5'- TT TTT TTT TTT TTT 
TTT AAT CCC ATG TAC AAT GCC -3' (SEQ ID NO: 14). 

1 5 The nucleotide sequence of the extension primer for the genotyping of the human DDAH1 
IVS2-330T mutation in a SNaPShot reaction was as follow: 5'- T TTT TTT TTT TTT 
TTT TTT TTT GTA C AG TCA CTG GTG CCA -3 ' (SEQ ID NO: 1 9). 

The nucleotide sequence of the extension primer for the genotyping of human FGB 
20 promoter -455G>A mutation in a SNaPshot reaction was as follow: 5'- TTT TTT TTT 
TTT TTT TTT TTT TTT TTT TTC TAT TTC AAA AGG GGC-3' (SEQ ID NO:24). 

The nucleotide sequence of the extension primer for the genotyping of human NPY gene - 
52 C>G mutation in a SNaPShot reaction was as follow: 5'- T TTT TTT TTT TTT TTT 
25 TTT TTT TTT TTT TTT GAG GAG GGA AGG TGC TGC G -3 * (SEQ ID NO:29). 

The nucleotide sequence of the extension primer for the genotyping of human LPL 
Asn291Ser mutation was as follow: 5'- TTT TTT TTT TTT TTT TTT TTT TTT TTT TTT 
TTT TTT TCT TTT GGC TCT GAC TTT A -3' (SEQ ID NO:49) 

30 

The nucleotide sequence of the extension primer for the genotyping of human ITGB3 
Leu33Pro mutation was as follow: 5'- TT TTT TTT TTT TTT TTT TTT TTT TTT TTT 
TTT TTT TTT TTT GTC AC A GCG AGG TGA GCC C -3' (SEQ ID NO: 56). 
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The nucleotide sequence of the extension primer for the genotyping of human NPPA 
OPA1 52Arg mutation was as follow: 5'- T TTT TTT TTT TTT TTT TTT TTT TTT TTT 
TTT TTT TTT TTT TTT TTT CTC CCT GGC TGT TAT CTT C -3 * (SEQ ID NO: 63). 

5 Post-extension treatment 

After the primer extension reaction 1 unit of SAP was added to the reaction mix and the 
reaction was incubated at 37°C for 1 hour. The enzyme was inactivated by incubating the 
reaction mix at 75°C for 15 minutes. Afterwards the samples were placed at 4°C. The 
10 post-extension treatment was done to prevent the unincorporated fluorescent ddNTPs 
obscuring the primer extension products (SNaPshot products) during electrophoresis with 
ABI Prism 3100 Genetic Analyzer. 

DNA fragment analysis of ADRA2B insertion/deletion polymorphism 

15 

ADRA2B insertion/deletion mutation 

ADRA2B gene (alpha2B-adrenergic receptor gene) insertion/deletion polymorphism (SEQ 
ID NO:30) (SEQ ID NO:32) was as follows 5'- GGG TGT TTG TGG GGC ATC TC -3' 
(SEQ ID NO:34) and 5'- TGG CAC TGC CTG GGG TTC A -3' (SEQ ID NO:35). A 
20 fluorescent label has been added to the 5' end of one of the above mentioned per primers. 
Thus, the per fragment is detectable in the capillary electrophoresis conducted with ABI 
Prism 3100 Genetic Analyzer. 

The insertion/deletion polymorphism of ADRA2B gene concerns an insertion or an 
25 deletion of three glutamic acids in the region of 12 Glu aminoacids in the codons 298-309 
(SEQ ID NO:30). Thus depending on the genotype, there is either 9 Glu (deletion) or 12 
Glu (insertion) at the ADRA2B locus. Depending on whether the amplified allele had an 
insertion or a deletion in the studied locus, the size of the per product was 91 bp (insertion 
allele) or 82 bp (deletion allele). Thus, for homotzygotes (insertion/insertion or 
30 deletion/deletion) only one size of a fragment was detected either 91 bp or 82 bp, 
respectively. For heterotzygotes both of the above mentioned fragments were detected. 
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Capillary electrophoresis with ABI Prism 3 1 00 Genetic Analyzer 

Aliquots of 1 ill of pooled SNaPshot products, 0.5-1.0 jxl of the ADRA2B 
insertion/deletion per product, 9.00 pi of Hi-Di formamide (Applied Biosystems) and 0.25 
fil GeneScan-120 LIZ size standard (Applied Biosystems) were combined in a 96-well 
5 3100 optical microamp plate (Applied Biosystems). The reactions were denatured by 
placing them at 95°C for 5 minutes and then loaded onto a ABI Prism 3100 Genetic 
Analyzer (Applied Biosystems). Electrophoresis data was processed and the genotypes 
were visualized by using the GeneScan Analysis version 3.7 (Applied Biosystems). 

10 Testing the Risk of MI and stroke 

Risk factors for MI and stroke were studied in the KIHD cohort. Briefly, the "Kuopio 
Ischaemic Heart Disease Risk Factor Study" (KIHD) is a prospective population study in 
men in Eastern Finland (Salonen 1988, Tuomainen et al. 1999). The study protocol for 

1 5 KIHD was approved by the Research Ethics Committee of the University of Kuopio. The 
study sample comprised men from Eastern Finland aged 42, 48, 54 or 60 years. A total of 
2682 men were examined during 1984-89. All participants gave a written informed 
consent. The follow-up of coronary and cerebrovascular events was to the end of 2000, 
providing an average follow-up time of 13.4 years. Genotypings were carried out for 

20 approximately 1 600 men, resulting to over 2 1 ,000 person-years of follow-up. 

Of the baseline examination participants, 1038 men were re-examined approximately four 
years after the baseline survey, in 1991-3. The mean follow-up time was 4.1 years. Of 
1 177 eligible men, 139 could not be contacted or refused to participate, 1038 (88.2%) men 
25 participated. 

A nested case-control set was selected consisting of 47 men who developed a MI by the 
end of 2000 and 47 control men matched for age, place of residence, fasting time and 
examination day, who had no MI by the end of 2000. Both the cases and the controls had 
30 no MI prior to the 1991-3 examination. Similarly, a case-control set of 22 men who had a 
stroke during the follow-up and 22 identically matched controls were selected. Neither 
group had a previous stroke prior to the 1991-3 examination. A large number of 
genotypings were carried out in these nested case-control sets. 
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Data on CHD and cerebrovascular disease during the follow-up were obtained by 
computer linkage to the national computerized hospital discharge registry. Diagnostic 
information was collected from the hospitals and all heart attacks and cerebrovascular 
events were classified according to rigid predefined criteria. The diagnostic classification 
5 of acute coronary events was based on symptoms, electrocardiographic findings, cardiac 
enzyme elevations, autopsy findings and the history of CHD. Each suspected coronary 
event (ICD-9 codes 410-414 and ICD-10 codes 120-125) was classified into 1) a definite 
acute myocardial infarction (AMI), 2) a probable AMI, 3) a typical acute chest pain 
episode of more than 20 minutes indicating CHD, 4) an ischemic cardiac arrest with 
10 successful resuscitation, 5) no acute coronary event or 6) an unclassifiable fatal case. The 
categories 1) to 3) were combined for the present analysis to denote MI. Cerebrovascular 
events were classified according to the FINNMONICA criteria. 

The purpose of this project was to develop a simple gene test that can be used to diagnose 
1 5 CHD and cerebrovascular disease and to predict the risk of acute myocardial infarction and 
stroke in healthy and sick persons. We had several data sets available to us for this work. 
The model was constructed in a prospective nested case-control set of 50 men who did not 
have prior MI but developed an MI during a 8-year follow-up, and 50 age-matched control 
men who did not develop MI during the follow-up. This case-control set was derived from 
20 the KIHD 1991-3 examination, in which over 1000 men aged 46-64 from Eastern Finland 
years were examined (see ref. 4). We typed over 100 mutations assumed to be relevant 
regarding CHD and stroke in DNA samples obtained at baseline, and collected phenotypic 
information yielding over 5000 variables. 

25 Of the about 100 mutations, the four most predictive ones of MI were selected using 

hierarchial step-up binary logistic modelling (Table 1). These predicted 61% of future Mis 
(R square 16%). Theoretically (based on twin studies), this is the maximal prediction that 
can be achieved by genes. The second step was to find the most predictive other variables. 
We tested similarly over 1000 variables including all known risk factors for CHD. A set of 

30 six variables (Table 1) was defined that increased the prediction to 80% (R square 53%), 
and the predicted probability of MI for each person varied from 0.0002 to 0.9991. These 
can be recorded using five simple questions and measuring waist and hip circumferences. 
None of the over 200 biochemical measurements tested contributed much additional 
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information to the model. The same concerned blood pressure and other clinical 
measurements. Age and gender are additionally needed in the model. 

We also constructed a 3-gene model which with four questionnaire variables predicted 
5 80% (R square 55%) of cerebrovascular strokes (Table 2). 

Thus, we invented a 10-variable model that predicted future myocardial infarction and a 7- 
variable model that predicted stroke very well in the data set they were derived of. The 
prediction of 80% is higher than in any published epidemiologic cohort study. An 
10 advantage is that only a small number of genotypings need to be carried out and a very 
short self-administered questionnaire needs to be filled in. One of the mutations in both 
tests is the same, so in total only six genotypings are needed to predict both MI and stroke. 
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